Control Task − Mapping Data − Migration Load − Balancing Job − Control
نویسنده
چکیده
In the course of the last few years, the user's interaction with parallel computer-systems has changed. A continuous growth in the number of interactive HPCapplications can be observed. When considering partitionable MPP-systems with exclusive usage of the physically separated regions, issues like the average waiting-time become more dominant for the users than the total system-throughput. In this paper, we focus on the problem of scheduling an arbitrary mixture of resource-requests for batch and interactive applications in an architecture-independent manner. To help users plan their daily work tight waiting-time estimations are indispensable. However, the resulting scheduling problem interferes with the problem of mapping requests onto certain MPParchitectures to reduce their internal fragmentations. We will show that this con ict can be alleviated by a distributed prover-veri er methodology. At rst, we will introduce the distributed resourcemanagement software CCS with its architectureindependent scheduling method. The message-based approach presented is used to verify the pre-calculated schedules with help of the system-dependent mapping instances. Simulations with the accounting data of our center have shown that tight waiting-time estimations can be made while the architecture-independent scheduling approach is still preserved. We will show that by using this methodology the mean error of the predicted waiting-time can be reduced by 76%. This work was partly supported by the German "Ministerium f ur Wissenschaft und Forschung" and the research cooperation NRW-Metacomputing. Further grants were provided by the "Stifterverband f ur die Deutsche Wissenschaft." Finally, we will discuss the impact of such a distributed resource-management system on the metacomputing challenge.
منابع مشابه
Comparative Study of Job Migration Algorithms for Autonomic Grid Management
Resources form the basis of a Grid. Resource management and load balancing are the main areas of concern in a distributed, heterogeneous and dynamic environment like Grid. Load balancing may further cause Job migration or in some cases re-submission of Job. In this paper a number of job migration algorithms have been surveyed and studied which have resulted because of the Load balancing problem...
متن کاملProvably Optimal Task Migration Time Using Alternative Routing Paths
Early experiments in migrating task images from one node in a metacomputing environment to another have shown, that it is possible to achieve almost maximum data throughput in LAN networks. With WAN-connected metacomputers however, the time spent in the task migration is not negligible. Here, alternative routing paths must be used concurrently to reduce the total migration time. We present a pr...
متن کاملAchieving Load Balancing of HDFS Clusters Using Markov Model
The combination of Hadoop and HDFS is becoming a defacto standard system in handling big data. HDFS is a distributed file system that is designed for big data. In HDFS, a file consists of multiple large sized blocks. A central management of HDFS tries to scatter these multiple blocks on different nodes to maximize the I/O throughput. Hadoop is a framework that supports data intensive parallel a...
متن کاملNetwork Load Balancing with Strong Migration in an Agent Based Grid System Using Csp Approach
In this paper we present a dynamic network traffic balancing approach using strong migration on an agent based grid environment. The paper focuses on three different areas, namely load balancing, strong mobility and CSP (Constraint Satisfaction Problem) approach. We use CSP approach for making task migration decisions. Strong task migration is used to move tasks between nodes at runtime to main...
متن کاملMARS - A framework for minimizing the job execution time in a metacomputing environment
Utilizing a collection of workstations and supercomputers in a metacomputing environment does not only o er an enormous amount of computing power, but also raises new problems. The true potential of WAN-based distributed computing can only be exploited if the application-to-architecture mapping re ects the di erent processor speeds, network performances and the application's communication chara...
متن کامل